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Abstract — A spatially correlated broadcast setting with M 
antennas at the base station and M users (each with a single 
antenna) is considered. We assume that the users have perfect 
channel information about their links and the base station has 
only statistical information about each user's link. The base 
station employs a linear beamforming strategy with one spatial 
eigen-mode allocated to each user. The goal of this work is 
to understand the structure of the beamforming vectors that 
maximize the ergodic sum-rate achieved by treating interference 
as noise. In the M = 2 case, we first fix the beamforming 
vectors and compute the ergodic sum-rate in closed-form as a 
function of the channel statistics. We then show that the optimal 
beamforming vectors are the dominant generalized eigenvectors 
of the covariance matrices of the two links. It is difficult to 
obtain intuition on the structure of the optimal beamforming 
vectors for M > 2 due to the complicated nature of the sum- 
rate expression. Nevertheless, in the case of asymptotic M, we 
show that the optimal beamforming vectors have to satisfy a set 
of fixed-point equations. 

I. Introduction 

The focus of this paper is on a MISO broadcast (downlink) 
setting where the base station (BS) has M antennas with M 
users in the cell, each having a single antenna. Under the 
assumption of perfect channel state information (CSI) at both 
the ends, significant progress has been made over the last 
few years on understanding optimal signaling that achieves 
the sum-capacity Q~|-||5l as well as the capacity region |6| 
of the multi-antenna broadcast channel. Though the capacity- 
achieving dirty paper coding scheme is well-understood, the 
complexity associated with it makes it an impractical choice. 
Thus, recent focus has been on a family of linear precoding 
schemes [7]-|9| which are within a fixed power-offset of the 
dirty paper coding scheme. In particular, a linear beamforming 
scheme that allocates one eigen-mode to each user is of 
considerable interest in standardization efforts. 

More importantly, while reasonably accurate CSI can be 
obtained at the users via pilot-based schemes, CSI at the BS re- 
quires either channel reciprocity or reverse link feedback, both 
of which put an overwhelming burden on the operating cost. 
Thus, there has been a significant interest on understanding 
the information-theoretic limits of broadcast channels under 
practical assumptions on CSI. In the extreme case of no CSI 
at the BS, the multiplexing gain possible in the perfect CSI 
case (M) is lost completely as it reduces to one. 

The no CSI assumption is pessimistic and in practice, the 
channel evolves fairly slowly on a statistical scale and it is 
possible to learn the statistics of the individual links at the 
BS with minimal cost. In the MISO broadcast setting with 
a Rayleigh fading model for each user (zero mean complex 
Gaussian fading process), the complete channel statistics are 
specified by the covariance matrix of the vector channel of 
the user. In this context, it must be noted that initial works 
assume that all the users experience fading that is independent 
and identically distributed (i.i.d.) across the antennas. That is, 
the covariance matrix of each user is the identity matrix. This 



assumption cannot be justified in practice unless the antennas 
at the BS are spaced wide apart and the scattering environment 
connecting the BS with the users is rich. While the correlated 
case has been studied in the literature ifTUl . ifTTl . the general 
version of the problem studied here has not received much 
attention. 

The focus of this work is on understanding the impact of the 
users' spatial statistics (their covariance matrices) on the sum- 
rate performance of the linear beamforming scheme. We first 
study the simplest non-trivial case of M — 2 and compute the 
sum-rate achievable with a linear beamforming scheme under 
the practical assumption that interference is treated as noise. 
For this, we exploit knowledge of the structure of density 
function of the weighted norm of isotropically distributed 
beamforming vectors fl2l . Our sum-rate characterization is 
explicit and in terms of the covariance matrices of the two 
users and the beamforming vectors. 

While identifying the structure of the sum-rate optimizing 
beamforming vectors is a difficult problem, in general, we 
obtain intuition in the low- and the high-SNR extremes. In the 
low-SNR extreme, it is not surprising that a strategy where 
the BS beamforms along the dominant eigen-mode of each 
user's channel is sum-rate optimal. In the high-SNR extreme, 
a strategy where the BS beamforms to a given user along the 
dominant generalized eigenvectotQ of that user's and the other 
user's covariance matrices is sum-rate optimal. Intuitively 
speaking, given that the BS has only statistical information 
of the two links, it generates an "effective" covariance matrix 
for a particular user by statistically pre-nulling the interference 
from the forward channel of the user. The sum-rate optimal 
beamforming vectors are the dominant eigen-modes of these 
effective covariance matrices. Solutions in terms of the gen- 
eralized eigenvectors are obtained in the perfect CSI case |9l , 
ff3l . but to the best of our knowledge, this solution in the 
statistical case is a first. While the generalization of this result 
to the M > 2 case is cumbersome, simple approximations 
for the ergodic sum-rate in terms of the channel statistics are 
provided in the asymptotics of M, Based on these approxi- 
mations, we show that the optimal beamforming vectors are 
solutions to a set of fixed-point equations. 
Note: Due to space constraints, the proofs of the main state- 
ments in this paper are not provided and the logic of the main 
arguments are sketched out in brief. 

II. System Setup 

We consider a broadcast setting with M antennas at the 
base station (BS) and M users, each with a single antenna. 
We denote the M x 1 channel between the BS and user i as 

1 A generalized eigenvector x (with the corresponding generalized eigen- 
value a) of a pair of matrices (A, B) satisfies the relationship A x = a B x. 
In the special case where B is invertible, a generalized eigenvector of the pair 
(A, B) is an eigenvector of B X A. If A and B are also positive definite, 
then all the generalized eigenvalues are also positive. 



hi, i = 1, • • • , M, While different multi-user communication 
strategies can be considered, as motivated in the Introduction, 
the focus here is on a linear beamforming scheme where the 
BS beamforms the information-bearing signal Si meant for 
user i with the M x 1 unit-normed vector w 8 . We assume that 
Si is unit energy and the BS divides its power budget of p 
equally across all the users. The received symbol y,; at user i 
is written as 



Vi 



i = i, 



,M 



where denotes the CAf(0, 1) complex Gaussian noise added 
at the receiver. 

We assume a Rayleigh fading (zero mean complex Gaus- 
sian) model for the channel and hence, the complete spatial 
statistics are described by the second-order moments of {h^}. 
With M antennas at the BS and a single antenna at each user, 
the channel h, of user i can be generically written as 



(1) 



where hnd,i is an M x 1 vector with i.i.d. CJ\f(0, 1) entries 
and Ei is the covariance matrix corresponding to the user i. 
In particular, with E^ = 1m for all users, (Q~|l reduces to the 
i.i.d. downlink model well-studied in the literature. 

The metric of interest in this work is the throughput from the 
BS to the users. Under the assumption of Gaussian inputs {si}, 
the instantaneous information-theoretic^ rate, Ri, achievable 
by user i with the linear beamforming scheme and using a 
mismatched decoder is given by 



Ri = log 1 



1 + M ^j^i 
M 
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In particular, the ergodic sum-rate achievable with the linear 
beamforming scheme is given by 



i=l 



With the spatial correlation model assumed in (|T), we can 
write 7; i as 



A I 



l0g(l + £:.h^ 4 V. i A 4 Vfh iid , 4 ) , 



where we have used the following eigen-decomposition in the 
second equation: 




Aj = diag^Ai, i, • • • , A 4j M ]j , Aj, x > • • • > A h M > 

2 A11 rate quantities will be assumed to be in nats/s/Hz in this work. 



Similarly, we can write 7^ 2 as 

4 2 =log(l + ^-hf d!l V ! A,Vfh iid , 



V,A. t Vf = S I 1 / 2 |E™?I S ' /2 







diag [Aj 1, 



, Aj «] ] , Afi > 



> A, 



> 0. 



Towards the goal of computing the ergodic rates, we expand 
hi id. i into its magnitude and directional components as hi id. j = 
||hiid,i|| • hiid, i- Note that ||hnd,i|| 2 can be written as 



2iU 



Liid, i 



3 = 1 



where is a standard (real) chi-squared random variable and 
hiid, i is a unit-normed vector that is isotropically distributed 
on the surface of M -dimensional complex sphere. Thus, we 
can rewrite 1 and 2 as 
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Further, since the magnitude and directional information of an 
i.i.d. (isotropically distributed) random vector are independent, 

E [7j i 1] and E [7^ 2] can be written as 

E\I iA ] - E [log (l + ^ ■ llhid,,:!! 2 ' Aihiid.i) 



£[7i, 2 ] =E log(l + ^ 



h i (: ., 1 ■ h'' , A, hiid, i 



where we have also used the fact that a fixeci unitary trans- 
formation of an isotropically distributed vector on the surface 
' of the complex sphere does not alter its distribution. 

III. Ergodic Sum-Rate: Two User Case 

The focus of this section is on computing the ergodic 
information-theoretic rates in closed-form in the special case 
of two users (A7 = 2). This closed-form expression will be 
a function of the covariance matrices of the two users, Sj 
and S 2 , and the choice of beamforming vectors, wi and 
w 2 . Once a closed-form expression is obtained, our goal lies 
in characterizing the structure of the optimal beamforming 
vectors as a function of the channel statistics and SNR. 

For simplicity, we assume that 

Ei = UdiagQA, A a ]) U H , S 2 = Udiag([ m M2 ]) U H (2) 

where U = [ui(Si), u 3 (Si)], U = [ui(Sa), u 2 (E 2 )], 
Ai > A 2 > and /ii > // 2 > (that is, both Ei and E 2 
are positive definite). Define the condition numbers K\ and K2 

as 



A ^1 i A Mi 

Ki = — and K2 = — . 
A 2 



(3) 



3 Note that the unitary transformation is independent of the channel realiza- 
tion when the beamforming vectors are chosen based on long-term statistics 
of the channel. 



Proposition 1: The ergodic information-theoretic rate 
achievable at user i (where i = 1, 2) with linear beamforming 
in the two user case is given by 

E[R i ]=E[I itl ]-E[I it2 ] 

2 \ A . _ pA,- 5 TP. I 2 



A; i • e pAi . 1 Ei 



Ai 2 • e pAi - 2 Ei 



Ai,i - A ii2 

- exp (2/pAi, i) E 1 (2/pki,i 

where Ei(x) = f°° is the exponential integral. The 

corresponding eigenvalues can be written in terms of Ej and 
the beamforming vectors as follows: 



A M 
A, 2 



A,+E t - ^/(A-B^+ACf 
2 





where Ai — wf^XjWj, i?i = w^S^Wj and Ci = jw^SjWjl 
with j ^ i and = 1, 2. 

Proof: Note that a closed-form computation of E 1 [1^ i] 
requires the density function of weighted norms of isotropi- 
cally distributed unit-normed vectors since 



E [I it i] = E x 



log(l + Xy) ?i{y)dy 



where X denotes the random variable X = § • 1 1 h^. ; 1 1 2 and X 
corresponds to a realization of X. Let Y denote the random 
variable 

2 



A,h; 



iid, i 



*j 3 



hiid,i(j) 



The Ritz-Rayleigh relationship implies that Aj, 2 < Y < A^ 1 
and the density function of Y evaluated at y is denoted as 
Pi(y). In [12 1, Pi(y) in the M = 2 case is shown to be 
uniform, that is, 

p *(y) = n. — — - A *.2 < v < Ai,i. 



1 



The statement of the proposition follows from a routine 
computation via the integral tables fl4l . ■ 
Note that understanding the structure of the optimal choice 
of beamforming vectors, (wi. op t,W2 )0 pt), that maximize the 
ergodic sum-rate as a function of Si, X2 and p is a hard 
problem, in general. Therefore, we consider the low- and the 
high-SNR extremes to obtain insights. 

Low-SNR Extreme: We need the following characterization 
of the exponential integral: 



1 1 , 

< - log 

+ 2 ~ 2 B 



1 



< Ei(x)e x < log 1 



< 



where the extremal inequalities are established by using the 
fact that jrpj < log(l + x) < x. Note that the upper and the 
lower bounds get tight as x — ¥ 00 (or p — > in this context). 
Using the above bound, we have 



E[R i ] p ^ P -{K,x+K,2-B i ) = P -- M = P - -wf 



In the low-SNR regime, the system is noise-limited and hence, 
the linear scaling of E [Ri] with SNR. It is also straightforward 
to note that maximizing E [Ri] is contingent on optimizing 
over Wi alone. Thus, the sum-rate is maximized by 

wi, opt = ui(Si) and w 2 , pt = Ui(S 2 ) 

where Ui(£;) denotes the dominant eigenvector (an eigenvec- 
tor corresponding to the dominant eigenvalue) of Xj. In other 
words, in the low-SNR extreme, each user signals along the 
optimal statistical eigen-mode of its channel (and ignoring the 
other user's channel completely). This conclusion should not 
be entirely surprising. The resulting ergodic sum-rate is given 
as 

p^o p 



K 



^max(Si) + A max (S2) 



High-SNR Extreme: The following expansion of the expo- 
nential integral is useful in characterizing 1Z as p — > 00: 



Ei{x) 



log - 



E 

k=l 



(-l) fc+1 s 



ft ■ ft! 



7 



log - + x - 7 



where 7 0.577 is the Euler-Mascheroni constant. Using the 
above approximation, we have 

p^oc A,-, 1 log (Aj, 1) - A it 2 log (Aj, 2 ) 



E [Ri 



log (Si) 



The dominating impact of interference (due to the fixed nature 
of the linear beamforming scheme where the beamforming 
vectors are not adapted to the channel realizations) and the 
consequent boundedness of E [Ri] as SNR increases should 
not be surprising. After some elementary manipulation, we 
can write E [Ri] as 

'AiBi-CT 



2E [Ri 



p— >oo 



log 



log 



B? 



Ai + Bi 



A l +B l 



{Ai - Bi 



AC? 



{A, - Bt 



AC? 



Bi 



{A, - Bi 



AC? 



We now rewrite the high-SNR ergodic rates in a form that 
eases further study. 

Proposition 2: Define ds i (wi,W2) between two unit- 
normed vectors W] and w 2 from the Grassmann manifold 

5(2,1) as 



d-Zi (wi, w 2 ) = 



li(AiBi-Ci) 



(Ai + Btf 

where Ai, Bi and Cj are as in the statement of Prop. Q] 

• (a) Then, (w^w^) is a generalized "distance" semi- 
metricQ between W] and w 2 satisfying < djv (•, •) < 1. 

• (b) We can recast the ergodic rate in terms of 

(is, (wi,w 2 ) as 

p^oo 3(rfE,(wi,w 2 )) 



E [Ri] + log(2) 



loe 



Ai 

1 + a 



4 A semi-metric satisfies all the properties necessary for a distance metric, 
except the triangle inequality. 



where 



g(z) = f{z) + 2\og{z), 



VT^z 



1 , i + VT^ 2 

log 



1 - s/T^z 2 



(c) While /(•) is monotonically decreasing as a function 
of its argument, <?(•) is increasing with 

2 log(2) = lim g(z) < g(z) < lim g(z) = 2 

Z~ >0 Z— >1 

oo = lim f(z) > f(z) > lim f(z) = 2. 

z— fO 2— >1 



We are now prepared to illustrate the structure of the optimal 
beamforming vectors. 

Theorem 1: The optimal choice of the pair (wi opt , W2, op t) 
that maximizes E [Ri] in the high-SNR regime is 



» i, opt 



- e 0lJl ui (S^ and Wj-, op t = e 3 " 2 u 2 (Si) , j ^ i 



for some choice of z/j 6 [0, 27r), i — 1,2. 



Proq/: Let x (Si) 



Tsrf denote the condition 



A mi „(Si, 

number of S^. We first note that the optimization problem over 
the choice of a pair (wi,W2) that results in a corresponding 
choice of (Ai, Sj,Cj) can be recast in the form of a two 
parameter optimization problem over (Mj, Ni) with Mi = 
Ni = j£ under the constraint that < N 2 < M t < \ (£*)■ 
This results in the following high-SNR expression: 



2E 2 log(2) 



It is straightforward to show that the choice in the theorem 
maximizes the above equation. ■ 
With this choice of beamforming vectors, <fej("> ') an d E [Ri] 
can be written as 

flSi (W<, opt, Wj, pt) = 



Ki + 1 



for some choice of i>i £ [0, 2tt), i = 1,2. The optimal sum- 
rate is given as 

if Kl > K2 
if Kl < «2 



E + £ [i? 2 ] 



whereas lim E [RA is dependent on how the eigenvectors 

p— > oo 

of Si are related to Sj. It is also to be noted that E[Ri] 
increases (and rfE 4 (-, •) decreases) as Ki increases. That is, the 
more ill-conditioned S, is, the larger the high-SNR statistical 
beamforming rate asymptote and vice versa. This should be 
intuitive as our goal is only to maximize E [Ri] and the above 
choice achieves that goal. 

We now consider the sum-rate setting restricted to the case 
where Si and S2 have the same set of orthonormal eigenvec- 
tors. Instead of using the definitions of Si and S2 as in (01, 
for simplicity, we will assume that U = U = [ui, 112]. We 
define k,\ and K2 as in (0. Without loss in generality, we can 
also assume that n\ > 1. Three possibilities arise depending 
on the relationship between 1, «i and K2'- i) «i > 1 > K2, 
ii) Ki > K2 > 1, and iii) K2 > n\ > 1. (Note that first case 
subsumes the setting where /ii = [12 = /1 and S2 = fil.) The 
main result is the following theorem. 

Theorem 2: The sum-rate is maximized by the following 
choice of beamforming vectors: 

Wi, pt = e jVl Ui, w 2 , op t = e 3iy2 U2 if i) or ii) is true, 
wi, opt = e J " 2 u 2 , w 2 , op t = e^ui if iii) is true 



Ki ■ lQg(Kl) 

K\ — 1 
K 2 ' lQg(K 2 ) 



log(K 2 ) 

K 2 — 1 

log(«l) 

K-i —1 



Proof: The proof follows by decomposing wi and W2 
along the obvious orthogonal basis of {ui, U2}: 

wi = aui + /3u 2 , w 2 = 7U1 + <5u 2 

for some choice of {a,(3,j, 6} with a = jaje^" (similarly, 



for other quantities) satisfying |a| 



h\ 2 + \5\ 2 = l. 



A direct optimization of the high-SNR sum-rate expression 
shows that {6>.} enters the optimization only via the term \(3-f— 
aS\, which can be maximized by setting Q a + 6$ — 6p — 9 1 = it 
(modulo 2tt). Parameterizing |a| and I7I as |a| = sin(6>) and 
I7I = sin(</>) for some {8,4>} £ [0,7r/2], we can show that 
the sum-rate is maximized by 6 = it/ 2 and <fi — if i) or 
ii) is true and by 9 = and <j> — it/2 if iii) is true. For this, 
we establish an upper bound to the sum-rate and show that 
this bound is achieved by the choice as in the statement of the 
theorem. ■ 

We now consider the general case where Si and S2 do not 
have the same set of eigenvectors. 

Theorem 3: In the general case, the sum-rate is maximized 

wi,„pt = e^u^S^Si), w 2 , pt = e^u^Sf 1 S 2 ) 

for some choice of Vi £ [0, 2ir), i = 1, 2. 

Proof: For this case, we define S and its corresponding 
eigen-decomposition as 

S 4 Si £"* = V diag ([ m 772]) V H 

where V = [vi v 2 ] and 771 > 772. Since £2 is a full rank 
matrix and {vi,v 2 } form a basis, the vectors S 2 2 vi and 
S 2 2 V2 also form a basis (albeit non-orthogonal, in general). 
We can decompose wi and W2 along these vectors as 



W] = 



aS 2 2 vi +/3S, 2 v 2 



/3S 2 



w 2 = 



7S 2 2 vi + 5S 2 2 v 2 



-2 vi-rp~ 2 ? v 2 || || 7 S 2 2 vi +<5S 2 -v 2 | 

for some choice of {a,/3, 7, 6} with a = \a\e : * ec ' (similarly, 
for other quantities) satisfying \a\ 2 + \/3\ 2 = \^f\ 2 + \S\ 2 = 1. 
A suitable coordinate transformation at this stage results in 
an optimization problem that is related to the special case of 
Theorem 12 After this transformation, the proof follows along 
the same logic as in Theorem [2] ■ 
The reason for the peculiar choice of decomposition in the 
above proof (instead of decomposing the beamforming vectors 

along {vi,V2}) is that S 2 2 Vj, i — 1,2 turn out to be the 
dominant generalized eigenvectors of the pairs (Si, S 2 ) and 
(S 2 , Si), respectively. Recall from Footnote Q] the definition 
of a generalized eigenvector. For the above claim, note that 



S 2 1 Si 



V diag ([771 772]) V ff )Sf =MDM 



S-l 1 S2 — ( S 2 1 Si 



MD M 



where M = S 2 2 V and D = diag ([771 772]). Theorem [2] is 
indeed a special case of Theorem [3] For this, note that the 
dominant eigenvector of S^ 1 Si is Ui and U2 when K\ > K2 
and K2 < K\, respectively. 



IV. Ergodic Sum-Rate: General M Case 

A recent advance 03], [16] allows a computation of the 
density function of weighted sum of standard central chi- 
squared terms (generalized chi-squared random variables). 
Alternate to the approach of Prop. Q] this approach allows 
closed-form expressions in the general M case. For example, 
if Aj(j), j = 1, • • • ,M are distinct we have 

M M , . 



Xfc = exp 



fc=U=i,j^fc 
P 



Aj(fc)M 



Ei 



A t (k)M 



For E 1 [Jj t 2], replace Aj by Aj. It can be checked that this 
expression matches with the expression in the M = 2 case. 

Nevertheless, it is important to note that the formula above 
is in terms of the eigenvalue matrices {Aj, Aj, i = 1, • • • , M }, 
which become harder (and impossible for M > 5) to compute 
in closed-form as a function of the beamforming vectors and 
the covariance matrices as M increases. Approximation to 
the generalized chi-squared random variable by a Gamma 
distribution with matching first two moments can also be used 
to produce sum-rate approximations. However, these approx- 
imations are of similar complexity as the above formula. In 
contrast, we now provide asymptotic approximations to the 
sum-rate directly in terms of the relevant variables. 

Proposition 3: For any fixed p, the ergodic information- 
theoretic rate achievable at user i (where i — 1 , • • • , M) 
converges as M — » 00 to 



E[Ri] 
SINR, 



log(l + SINRi) =fti, 



A S% 



1 



wf HiVfj 



Proof: The proof follows along a law of large numbers- 
type argument, strengthened to convergence in mean via a 
suitable truncation technique. ■ 
Proposition 4: Based on the above expression, we have the 
following conclusions that mirror the main results of Sec. Hn] 
i) We have the following bound for oo'- 

M TP 

1 ■ max > wf SjW,- < Vr < 1- 

j=l M Z^i=l w i Zj 'i w i 

Thus, the optimal beamforming vectors as p — > are such that 
Wj, pt = ui(Sj), i = ,M, ii) For any p, we have 



< log 1 



^•Ai(Sj) 



M 



and T^i.oo is maximized by Wi, opt = Ui(Sj), and 

{wj, opt, i = 1,- •• , M, j ^ i} = {^(Ej), j = 2, • • • , M}. 

iii) X)j = i ^-i, 00 is optimized by the set of beamforming 
vectors that solve the following fixed-point equations: 



E01 i\i r 



lj • (1 + SINRj) V >7 • (1 + SINR,) 



SI NR., • S,-Wj 

J J =0, i = !,-■■ ,M. 



5 More complicated expressions can be obtained in case {Ai(j)} are not 
distinct. 



V. Conclusion 

We have studied statistics-based linear beamformer design 
for the MISO broadcast channel in this work. Based on a 
closed-form computation of the ergodic sum-rate in the M = 2 
(two-user) case, we provide intuition on the structure of the 
optimal beamforming vectors that maximize the sum-rate in 
the low- and the high-SNR extremes. While further intuition 
on the small M case seems difficult, in the asymptotics of 
M, we are able to obtain intuition on the structure of the 
optimal beamforming vectors. The case of optimal statistical 
linear beamforming design has not received much attention in 
the literature and our work sets the course for a systematic 
and low-complexity limited feedback design in the broadcast 
setting, which is of considerable importance in the standard- 
ization efforts. 
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